Feature selection, statistical modeling and its applications to universal JPEG steganalyzer
نویسنده
چکیده
Steganalysis deals with identifying the instances of medium(s) which carry a message for communication by concealing their exisitence. This research focuses on steganalysis of JPEG images, because of its ubiquitous nature and low bandwidth requirement for storage and transmission. JPEG image steganalysis is generally addressed by representing an image with lower-dimensional features such as statistical properties, and then training a classifier on the feature set to differentiate between an innocent and stego image. Our approach is two fold: first, we propose a new feature reduction technique by applying Mahalanobis distance to rank the features for steganalysis. Many successful steganalysis algorithms use a large number of features relative to the size of the training set and suffer from a ”curse of dimensionality”: large number of feature values relative to training data size. We apply this technique to state-of-the-art steganalyzer proposed by Tomás Pevný (54) to understand the feature space complexity and effectiveness of features for steganalysis. We show that using our approach, reduced-feature steganalyzers can be obtained that perform as well as the original steganalyzer. Based on our experimental observation, we then propose a new modeling technique for steganalysis by developing a Partially Ordered Markov Model (POMM) (23) to JPEG images and use its properties to train a Support Vector Machine. POMM generalizes the concept of local neighborhood directionality by using a partial order underlying the pixel locations. We show that the proposed steganalyzer outperforms a state-of-the-art steganalyzer by testing our approach with many different image databases, having a total of 20000 images. Finally, we provide a software package with a Graphical User Interface that has been developed to make this research accessible to local state forensic departments.
منابع مشابه
Recent Advances in Information Processing & Intelligent Information Systems and Applications - Track on Multimedia
In this paper, we propose a novel blind steganalytic scheme able to detect JPEG stego images embedded with several known steganographic programs. By estimating the original image of the given image, thirteen types of statistics are collected in the DCT domain and the decompressed spatial domain. Then we calculate the histogram characteristic function (HCF) and the center of mass (COM) for each ...
متن کاملAn Optimized Low Volume Blind Universal Steganalyzer with improved Generalization
Background: Generic Steganalysis proves to be a boon when there is a suspicion of covert channels with no other information regarding stego images. With the advent of sophisticated steganographic techniques, the process becomes tough as the hidden data is very meager and leaves undecipherable artifacts. A Universal, Blind and Statistical Steganalyzer needs to be more generalized as it encounter...
متن کاملAn Overview of the New Feature Selection Methods in Finite Mixture of Regression Models
Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...
متن کاملA Blind Steganalytic Scheme Based on DCT and Spatial Domain for JPEG Images
In this paper, we propose a novel blind steganalytic scheme able to detect JPEG stego images embedded with several known steganographic programs. By estimating the original image of the given image, thirteen types of statistics are collected in the DCT domain and the decompressed spatial domain. Then we calculate the histogram characteristic function (HCF) and the center of mass (COM) for each ...
متن کاملInfluence of embedding strategies on security of steganographic methods in the JPEG domain
In this paper, we study how specific design principles and elements of steganographic schemes for the JPEG format influence their security. Our goal is to shed some light on how the choice of the embedding operation and domain, adaptive selection channels, and syndrome coding influence statistical detectability. In the experimental part of this paper, the detectability is evaluated using a stat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015